Model Selection

Multimodal Document QA

# Multimodal Document QA

4-bit quantized version of GLM-4V-9B, supporting multimodal multilingual understanding with memory usage under 9G, outperforming multiple mainstream models

Transformers Supports Multiple Languages

Layoutlm Invoices

A multimodal LayoutLM model fine-tuned for invoice and other document QA tasks, supporting discontinuous text recognition

Image-to-Text English

Layoutlm Invoices

A document QA model fine-tuned based on the LayoutLM architecture, specifically designed for processing discontinuous text recognition in invoices and other documents

Transformers English

Layoutlm Invoices

A document QA model fine-tuned based on the LayoutLM architecture, specifically designed for handling invoice and other document QA tasks

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase